The TASX-environment: an XML-based toolset for time aligned speech corpora
نویسندگان
چکیده
This paper describes the design and implementation of an XML-based corpus environment for multi-tier annotated speech data. The TASX-environment (TASX: Time Aligned Signal data eXchange format) constitutes the technical basis for a corpus designed to explore the acquisition of prosody by second language learners. It supports all aspects of the corpus setup procedure: XML-based annotation of the speech data, all transformation of non XML-annotations, and the web-based analysis and dissemination of the data.
منابع مشابه
A Prosodic Corpus of Non-Native Speech
The paper describes the design and implementation of an XML-based corpus environment for prosodically annotated data. The TASX-environment (TASX: Time Aligned Signal data eXchange format) constitutes the technical basis for a corpus designed to explore the acquisition of prosody by second language learners. It supports all aspects of the corpus setup procedure: XML-based annotation of the speec...
متن کاملThe TASX-environment: an XML-based corpus database for time aligned language data
The paper describes the design and implementation of an XML-based corpus environment for time aligned language/signal data. The TASX-environment constitutes the technical basis for a phonetic corpus designed to explore the acquisition of prosody by second language learners.
متن کاملQuerying Annotated Speech Corpora
This paper is concerned with querying annotated speech corpora. A growing number of such corpora is currently being created worldwide; however, their usefulness for a wider research community is restricted by the lack of standard tools for creating, editing, annotating, storing and querying them. Two solutions for these problems are presented here: the XML-based data format TASX for corpus crea...
متن کاملMultimodale bilinguale Korpora gesprochener Sprache: Korpuserstellung, -analyse und -dissemination in der TASX-Umgebung
Zusammenfassung: Dieser Beitrag beschreibt die TASX-Korpusumgebung, ein XMLbasiertes System zur Erstellung und Auswertung von großen Sprachkorpora. Das Time Aligned Signal Data Exchange Format (TASX) wurde speziell entwickelt für die Annotation zeitlich geordneter, multimodaler Sprachdaten. Anhand zweier exemplarisch vorgestellter multimodaler, multilingualer Korpora gesprochener Sprache werden...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002